Non-volatile memory with linear estimation of initial programming voltage

ABSTRACT

In a non-volatile memory, a selected page on a word line is successively programmed by a series of voltage pulses of a staircase waveform with verifications in between the pulses until the page is verified to a designated pattern. The programming voltage at the time the page is programmed verified will be to estimate the initial value of a starting programming voltage for the page. The estimation is further refined by using the estimate from a first pass in a second pass. Also, when the test is over multiple blocks, sampling of word lines based on similar geometrical locations of the blocks can yield a starting programming voltage optimized for faster programming pages.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is related to the following U.S. patent applications:U.S. application Ser. No. 11/531,217, entitled “Method For Non-VolatileMemory With Reduced Erase/Write Cycling During Trimming Of InitialProgramming Voltage,” by Yan Li, et al., filed on Sep. 12, 2006. U.S.application Ser. No. 11/531,223, entitled “Non-Volatile Memory WithReduced Erase/Write Cycling During Trimming Of Initial ProgrammingVoltage,” by Yan Li, et al., filed on Sep. 12, 2006. U.S. applicationSer. No. 11/531,227, entitled “Method For Non-Volatile Memory WithLinear Estimation Of Initial Programming Voltage,” by Loc Tu, et al.,filed on Sep. 12, 2006, now U.S. Pat. No. 7,453,731.

FIELD OF THE INVENTION

This invention relates generally to non-volatile semiconductor memorysuch as electrically erasable programmable read-only memory (EEPROM) andflash EEPROM, and specifically to determining optimum initialprogramming voltages of various groups of memory cells.

BACKGROUND OF THE INVENTION

Solid-state memory capable of nonvolatile storage of charge,particularly in the form of EEPROM and flash EEPROM packaged as a smallform factor card, has recently become the storage of choice in a varietyof mobile and handheld devices, notably information appliances andconsumer electronics products. Unlike RAM (random access memory) that isalso solid-state memory, flash memory is non-volatile and retains itsstored data even after power is turned off. In spite of the higher cost,flash memory is increasingly being used in mass storage applications.Conventional mass storage, based on rotating magnetic medium such ashard drives and floppy disks, is unsuitable for the mobile and handheldenvironment. This is because disk drives tend to be bulky, are prone tomechanical failure and have high latency and high power requirements.These undesirable attributes make disk-based storage impractical in mostmobile and portable applications. On the other hand, flash memory, bothembedded and in the form of a removable card is ideally suited in themobile and handheld environment because of its small size, low powerconsumption, high speed and high reliability features.

EEPROM and electrically programmable read-only memory (EPROM) arenon-volatile memory that can be erased and have new data written or“programmed” into their memory cells. Both utilize a floating(unconnected) conductive gate, in a field effect transistor structure,positioned over a channel region in a semiconductor substrate, betweensource and drain regions. A control gate is then provided over thefloating gate. The threshold voltage characteristic of the transistor iscontrolled by the amount of charge that is retained on the floatinggate. That is, for a given level of charge on the floating gate, thereis a corresponding voltage (threshold) that must be applied to thecontrol gate before the transistor is turned “on” to permit conductionbetween its source and drain regions.

The floating gate can hold a range of charges and therefore can beprogrammed to any threshold voltage level within a threshold voltagewindow. The size of the threshold voltage window is delimited by theminimum and maximum threshold levels of the device, which in turncorrespond to the range of the charges that can be programmed onto thefloating gate. The threshold window generally depends on the memorydevice's characteristics, operating conditions and history. Eachdistinct, resolvable threshold voltage level range within the windowmay, in principle, be used to designate a definite memory state of thecell.

In the usual two-state EEPROM cell, at least one current breakpointlevel is established so as to partition the conduction window into tworegions. When a cell is read by applying predetermined, fixed voltages,its source/drain current is resolved into a memory state by comparingwith the breakpoint level (or reference current I_(REF)). If the currentread is higher than that of the breakpoint level, the cell is determinedto be in one logical state (e.g., a “zero” state). On the other hand, ifthe current is less than that of the breakpoint level, the cell isdetermined to be in the other logical state (e.g., a “one” state). Thus,such a two-state cell stores one bit of digital information. A referencecurrent source, which may be externally programmable, is often providedas part of a memory system to generate the breakpoint level current.

In order to increase memory capacity, flash EEPROM devices are beingfabricated with higher and higher density as the state of thesemiconductor technology advances. Another method for increasing storagecapacity is to have each memory cell store more than two states.

For a multi-state or multi-level EEPROM memory cell, the conductionwindow is partitioned into more than two regions by more than onebreakpoint such that each cell is capable of storing more than one bitof data. The information that a given EEPROM array can store is thusincreased with the number of states that each cell can store. EEPROM orflash EEPROM with multi-state or multi-level memory cells have beendescribed in U.S. Pat. No. 5,172,338.

The transistor serving as a memory cell is typically programmed to a“programmed” state by one of two mechanisms. In “hot electroninjection,” a high voltage applied to the drain accelerates electronsacross the substrate channel region. At the same time a high voltageapplied to the control gate pulls the hot electrons through a thin gatedielectric onto the floating gate. In “tunneling injection,” a highvoltage is applied to the control gate relative to the substrate. Inthis way, electrons are pulled from the substrate to the interveningfloating gate.

The memory device may be erased by a number of mechanisms. For EPROM,the memory is bulk erasable by removing the charge from the floatinggate by ultraviolet radiation. For EEPROM, a memory cell is electricallyerasable, by applying a high voltage to the substrate relative to thecontrol gate so as to induce electrons in the floating gate to tunnelthrough a thin oxide to the substrate channel region (i.e.,Fowler-Nordheim tunneling.) Typically, the EEPROM is erasable byte bybyte. For flash EEPROM, the memory is electrically erasable either allat once or one or more blocks at a time, where a block may consist of512 bytes or more of memory.

The memory devices typically comprise one or more memory chips that maybe mounted on a card. Each memory chip comprises an array of memorycells supported by peripheral circuits such as decoders and erase, writeand read circuits. The more sophisticated memory devices operate with anexternal memory controller that performs intelligent and higher levelmemory operations and interfacing.

When a cell is programmed to a given state, it is subject to successiveprogramming voltage pulses, each time adding incremental charge to thefloating gate. In between pulses, the cell is read back or verified todetermine its source-drain current relative to the breakpoint level.Programming stops when the current state has been verified to reach thedesired state. The programming pulse train used may have increasingperiod or amplitude in order to counteract the accumulating electronsprogrammed into the charge storage unit of the memory cell. Programmingcircuits generally apply a series of programming pulses to a selectedword line. In this way, a page of memory cells whose control gates areconnected to the word line can be programmed together.

To achieve good programming performance, the initial programming voltageV_(PGM0) and the step size must be optimally chosen. If the initialprogramming voltage V_(PGM0) is chosen too low, it may require anexcessive number of programming pulses to reach the target state. On theother hand if V_(PGM0) is chosen too high, especially in a multi-statememory, the programming may overshoot the target state in the firstpulse. An optimum initial programming voltage V_(PGM0) would reach thetarget state in a few steps. The optimum V_(PGM0) is fairly sensitive tomanufacturing variations and is traditionally determined by testing atthe factory. This is a process known as V_(PGM0) trimming.

Conventionally, before shipping from the factory, a dedicated memorytester is setup to test a number of memory chips in parallel. One of thetests is to determine optimum initial programming voltages (V_(PGM0)trimmings.) Conventional V_(PGM0) trimmings are therefore performed bymemory testers that are expensive dedicated machines. Moreover, theytend to test each word line in a piece-meal manner, moving to the nextword line after the testing on the current one has been completed. Inthis manner, a page of memory cells on a word line is programmed in aprogram loop to test if it is programmable to a target pattern (e.g.,“0000 . . . 0” where “0” denote a given programmed state). The programloop typically uses a series of programming voltage pulses from a firststarting programming voltage. The page is then read back in a verifyoperation to determine if it has been properly programmed to a targetpattern. If not program-verified, the page/word line of cells is erasedand reprogrammed again in the next program loop with an incrementedstarting programming voltage. This process is repeated until the page isprogram-verified. In this way, the determination can be made of thevalue of the starting programming voltage that enables the page to beprogram-verified.

A number of trials in terms of program loops with increasing initialprogramming voltages may be needed to obtain the one that enables thepage to be programmed properly. It can be seen that in conventionalV_(PGM0) trimmings, the page must be erased before the next program loopis performed using an incremented starting voltage. Thus, the word lineof memory cells carrying the page could be erased multiple times duringthese trials. Furthermore, all other word lines in the same erase blockare also erase-cycled.

Non-volatile memory device has a limited life usage due to theendurance-related stress suffered each time the device goes through anerase/program cycle. For example, the endurance of a Flash EEPROM deviceis its ability to withstand a given number of program/erase cycles. Thephysical phenomenon limiting the endurance of non-volatile memorydevices is the trapping of electrons in the active dielectric films ofthe device. Referring to FIG. 2, during programming, electrons areinjected from the substrate to the charge storage unit through adielectric interface. Similarly, during erasing, electrons are extractedfrom the charge storage unit through a dielectric interface. In bothcases, some of the electrons are trapped by the dielectric interface.The trapped electrons oppose the applied electric field in subsequentprogram/erase cycles thereby causing the programmed threshold voltage toshift to a lower value and the erased threshold voltage to shift to ahigher value. This can be seen in a gradual closure in the thresholdwindow. The threshold window closure is what limits the practicalendurance to approximately 10⁴ program/erase cycles.

In a memory architecture where there are many word lines in each block,erasing a word line of cells multiple times would entail erasing therest of the word lines in the same block the same number of times. Ifthese other word lines in the block are also being tested, the number oftimes the block is erased would go up geometrically. For example, if ittakes roughly 10 trials for each word line, and there are 64 word linesin each block, it will mean the block will suffer erase cycling of atotal of 640 times. Furthermore, V_(PGM0) trimming is also performed tocover a number of other variables. For example, the word line may carrymultiple physical pages as well as multiple logical pages. The wordlines near the block boundary may have slightly different programmingcharacteristics compared to the ones in the core region. Thesevariations could contribute another factor of 10 to the number oftrimmings needed. Thus, conventional V_(PGM) trimmings at the factorycould consume as much as several thousand endurance cycles of a memorydevice. As much as half of a memory device's life usage could be used upbefore it gets to a customer.

Therefore there is a general need for high performance and high capacitynon-volatile memory. In particular, there is a need for a non-volatilememory with optimally set starting programming voltages, yet without theexpense of excessively endurance cycling the memory to determine them.

SUMMARY OF INVENTION

Estimation of a Starting Voltage by Scaling

According to another aspect of the invention, the initial value of astarting programming voltage is estimated by an initial programming testrun of the page on a word line. A selected page on a word line issuccessively programmed by a series of voltage pulses of a staircasewaveform with verifications in between the pulses until the page isverified to have been programmed to a designated pattern. The finalprogramming voltage at the time the page is program-verified will beused to estimate a starting programming voltage by scaling back apredetermined amount. An average starting programming voltage isobtained by considering a sample of similar page/word lines. Anyunprogrammable page/word lines in the sample can be ignored so as not toskew the statistics with atypical entries.

In another embodiment, the process is further refined in which theestimated starting programming voltage from a first pass is used as theinitial value of the staircase waveform in a second pass. In this way,when averaging over a sample of similar pages, the starting programmingvoltage for a representative page can be estimated. The startingprogramming voltage is estimated by offsetting the final programmingvoltage negatively by a predetermined number of steps of the staircasewaveform. The predetermined number of steps is preferably similar to thenumber of steps budgeted for program success in a normal programoperation.

One advantage of this scaling scheme is that a simple one- or two-passprogramming test on each page/word line is sufficient to yield anestimate for the starting programming voltage for the page. Each pagecan be tested independently and does not involve multiple eraseoperation during the test. Therefore there is no need for management ofblock erase among a sample of word lines.

V_(PGM) Trimming Weighted Toward Faster Programming Pages

According to another aspect of the invention, in a memory array havingmultiple erasable blocks, each block having a group of word lines withsimilar type of programming characteristics, a scheme for obtaining anoptimum starting programming voltage of a representative page of thegroup includes: forming samples over a set of blocks with a word linefrom a geometrically similar location of each block of the set,obtaining a statistic estimation of a programming voltage from eachsample of the set, and selecting a minimum estimation among the set toderive the optimum starting programming voltage. In this way, theoptimum value is weighted towards the faster programming word lines forthat group since they require a lower programming voltage compared tothe slower ones.

The scheme of testing individual samples formed by selecting at least asimilar page from each block also has the advantage of minimum storagerequirement. After each sample is tested, a test result in the form ofan average is obtained and stored. Then the next sample is tested insimilar manner and its average is then compared to the first one instorage. Whichever average is the lower one will be retained in storageso that only one data need be stored as the set of samples is processeda sample at a time.

The programming voltage trimming schemes described in other sectionsexamine a page at a time as to whether all bits in the page areprogram-verified or not. This implies the test results are catering tothe slower programming bits, as these slower bits must also beprogram-verified before the whole page is deemed program-verified. Theconsequence is that the starting voltage may be over estimated for thefaster programming bits with the danger of over-programming. The presentsampling and statistical computational scheme allows a lowest value tobe selected for the set of starting voltages that was derived from ascheme biased towards the slower programming bits.

Also, with the sample formed by selecting a relatively small portionfrom each of the blocks, another advantage is that the sample average isnot as sensitive to the presence of any bad blocks where a large portionof the word lines in it may be defective.

Additional features and advantages of the present invention will beunderstood from the following description of its preferred embodiments,which description should be taken in conjunction with the accompanyingdrawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates schematically the functional blocks of a non-volatilememory chip.

FIG. 2 illustrates schematically a non-volatile memory cell.

FIG. 3 illustrates the relation between the source-drain current I_(D)and the control gate voltage V_(CG) for four different charges Q1-Q4that the floating gate may be selectively storing at any one time.

FIG. 4 illustrates an example of an NOR array of memory cells.

FIG. 5A illustrates schematically a string of memory cells organizedinto an NAND string.

FIG. 5B illustrates an example of an NAND array of memory cells,constituted from NAND strings such as that shown in FIG. 5A.

FIG. 6 illustrates schematically, an example of a memory array organizedin erasable blocks.

FIG. 7 illustrates a series of programming voltage pulses in the form ofa staircase waveform being applied to a selected word line.

FIG. 8 illustrates a typical testing setup to determine optimum initialprogramming voltages for a number of memory chips.

FIG. 9 illustrates schematically the function blocks of the memorytester testing one of the memory chips shown in FIG. 8 for determinationof optimum initial programming voltages.

FIG. 10 illustrates the function blocks of an alternate memory testeroperating with one of the memory chips shown in FIG. 8 for determinationof optimum initial programming voltages, according to a preferredembodiment.

FIG. 11A is a flow diagram illustrating a general scheme for obtainingan estimated starting programming voltage for a given type of word linesin a memory device.

FIG. 11B illustrates in more detail one embodiment of selecting a goodblock shown in FIG. 11A.

FIG. 12 is a flow diagram illustrating a conventional implementation ofthe steps of determining an initial programming voltage of a page on aword line.

FIG. 13 is a flow diagram illustrating generally an operation forestimating an optimum starting programming voltage from a sample of wordlines within a block, according to a preferred embodiment of theinvention.

FIG. 14 is a flow diagram illustrating a specific implementation of theoperation shown in FIG. 13.

FIG. 15 illustrates the staircase waveform used in the initialprogramming test of a page of memory cells.

FIG. 16 is a flow diagram illustrating the determination of startingprogramming voltage for V_(PGM) trimming, using the staircase waveformscan shown in FIG. 15.

FIG. 17 is a flow diagram illustrating a multiple pass determination ofstarting programming voltage for a sample of pages/word lines.

FIG. 18 is a flow diagram illustrating the scheme of obtaining a V_(PGM)trimmed value that is weighted toward the faster programming word lines.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Memory System

FIG. 1 to FIG. 7 illustrate example memory systems in which the variousaspects of the present invention may be implemented.

FIG. 1 illustrates schematically the functional blocks of a non-volatilememory chip. The memory chip 100 includes a two-dimensional array ofmemory cells 200, control circuitry 210, and peripheral circuits such asdecoders, read/write circuits and multiplexers. The memory array 200 isaddressable by word lines (see FIG. 2) via row decoders 230A and 230Band by bit lines (see FIG. 2) via column decoders 260A and 260B. Theread/write circuits 270A and 270B allow a page of memory cells to beread or programmed in parallel. In a preferred embodiment, a page isconstituted from a contiguous row of memory cells sharing the same wordline. In another embodiment, where a row of memory cells are partitionedinto multiple pages, block multiplexers 250A and 250B are provided tomultiplex the read/write circuits 270A and 270B to the individual pages.

The control circuitry 210 cooperates with the read/write circuits 270 toperform memory operations on the memory array 200. The control circuitry210 typically includes a state machine 212 and other circuits such as anon-chip address decoder and a power control module (not shownexplicitly). The state machine 212 provides chip level control of memoryoperations.

The memory array 200 is typically organized as a two-dimensional arrayof memory cells arranged in rows and columns and addressable by wordlines and bit lines. The array can be formed according to an NOR type oran NAND type architecture.

FIG. 2 illustrates schematically a non-volatile memory cell. The memorycell 10 can be implemented by a field-effect transistor having a chargestorage unit 20, such as a floating gate or a dielectric layer. Thememory cell 10 also includes a source 14, a drain 16, and a control gate30.

There are many commercially successful non-volatile solid-state memorydevices being used today. These memory devices may employ differenttypes of memory cells, each type having one or more charge storageelement.

Typical non-volatile memory cells include EEPROM and flash EEPROM.Examples of EEPROM cells and methods of manufacturing them are given inU.S. Pat. No. 5,595,924. Examples of flash EEPROM cells, their uses inmemory systems and methods of manufacturing them are given in U.S. Pat.Nos. 5,070,032, 5,095,344, 5,315,541, 5,343,063, 5,661,053, 5,313,421and 6,222,762. In particular, examples of memory devices with NAND cellstructures are described in U.S. Pat. Nos. 5,570,315, 5,903,495,6,046,935. Also, examples of memory devices utilizing dielectric storageelement have been described by Eitan et al., “NROM: A Novel LocalizedTrapping, 2-Bit Nonvolatile Memory Cell,” IEEE Electron Device Letters,vol. 21, no. 11, November 2000, pp. 543-545, and in U.S. Pat. Nos.5,768,192 and 6,011,725.

In practice, the memory state of a cell is usually read by sensing theconduction current across the source and drain electrodes of the cellwhen a reference voltage is applied to the control gate. Thus, for eachgiven charge on the floating gate of a cell, a corresponding conductioncurrent with respect to a fixed reference control gate voltage may bedetected. Similarly, the range of charge programmable onto the floatinggate defines a corresponding threshold voltage window or a correspondingconduction current window.

Alternatively, instead of detecting the conduction current among apartitioned current window, it is possible to set the threshold voltagefor a given memory state under test at the control gate and detect ifthe conduction current is lower or higher than a threshold current. Inone implementation the detection of the conduction current relative to athreshold current is accomplished by examining the rate the conductioncurrent is discharging through the capacitance of the bit line.

FIG. 3 illustrates the relation between the source-drain current I_(D)and the control gate voltage V_(CG) for four different charges Q1-Q4that the floating gate may be selectively storing at any one time. Thefour solid I_(D) versus V_(CG) curves represent four possible chargelevels that can be programmed on a floating gate of a memory cell,respectively corresponding to four possible memory states. As anexample, the threshold voltage window of a population of cells may rangefrom 0.5V to 3.5V. Six memory states may be demarcated by partitioningthe threshold window into five regions in interval of 0.5V each. Forexample, if a reference current, I_(REF) of 2 μA is used as shown, thenthe cell programmed with Q1 may be considered to be in a memory state“1” since its curve intersects with I_(REF) in the region of thethreshold window demarcated by V_(CG)=0.5V and 1.0V. Similarly, Q4 is ina memory state “5”.

As can be seen from the description above, the more states a memory cellis made to store, the more finely divided is its threshold window. Thiswill require higher precision in programming and reading operations inorder to be able to achieve the required resolution.

FIG. 4 illustrates an example of an NOR array of memory cells. In thememory array 300, each row of memory cells are connected by theirsources 14 and drains 16 in a daisy-chain manner. This design issometimes referred to as a virtual ground design. The cells 10 in a rowhave their control gates 30 connected to a word line, such as word line42. The cells in a column have their sources and drains respectivelyconnected to selected bit lines, such as bit lines 34 and 36.

FIG. 5A illustrates schematically a string of memory cells organizedinto an NAND string. An NAND string 50 comprises of a series of memorytransistors M1, M2, . . . Mn (e.g., n=4, 8, 16 or higher) daisy-chainedby their sources and drains. A pair of select transistors S1, S2controls the memory transistors chain's connection to the external viathe NAND string's source terminal 54 and drain terminal 56 respectively.In a memory array, when the source select transistor S1 is turned on,the source terminal is coupled to a source line (see FIG. 5B).Similarly, when the drain select transistor S2 is turned on, the drainterminal of the NAND string is coupled to a bit line of the memoryarray. Each memory transistor in the chain has a charge storage element20 to store a given amount of charge so as to represent an intendedmemory state. A control gate of each memory transistor provides controlover read and write operations. As will be seen in FIG. 5B, the controlgates of corresponding memory transistors of a row of NAND string areall connected to the same word line. Similarly, a control gate of eachof the select transistors S1, S2 provides control access to the NANDstring via its source terminal 54 and drain terminal 56 respectively.Likewise, the control gates of corresponding select transistors of a rowof NAND string are all connected to the same select line.

When an addressed memory transistor within an NAND string is read or isverified during programming, its control gate is supplied with anappropriate voltage. At the same time, the rest of the non-addressedmemory transistors in the NAND string 50 are fully turned on byapplication of sufficient voltage on their control gates. In this way, aconductive path is effective created from the source of the individualmemory transistor to the source terminal 54 of the NAND string andlikewise for the drain of the individual memory transistor to the drainterminal 56 of the cell. Memory devices with such NAND string structuresare described in U.S. Pat. Nos. 5,570,315, 5,903,495, 6,046,935.

FIG. 5B illustrates an example of an NAND array of memory cells,constituted from NAND strings such as that shown in FIG. 5A. Along eachcolumn of NAND strings, a bit line such as bit line 36 is coupled to thedrain terminal 56 of each NAND string. Along each bank of NAND strings,a source line such as source line 34 is couple to the source terminals54 of each NAND string. Also control gates along a row of cells in abank of NAND strings are connected to a word line. An entire row ofmemory cells in a bank of NAND strings can be addressed by appropriatevoltages on the word lines and select lines of the bank of NAND string.When a memory transistor within a NAND string is being read, theremaining memory transistors in the string are turned on hard via theirassociated word lines so that the current flowing through the string isessentially dependent upon the level of charge stored in the cell beingread.

FIG. 6 illustrates schematically, an example of a memory array organizedin erasable blocks. Programming of charge storage memory devices canonly result in adding more charge to its charge storage elements.Therefore, prior to a program operation, existing charge in chargestorage element of a memory cell must be removed (or erased). Anon-volatile memory such as EEPROM is referred to as a “Flash” EEPROMwhen an entire array of cells, or significant groups of cells of thearray, is electrically erased together (i.e., in a flash). Once erased,the group of cells can then be reprogrammed. The group of cells erasabletogether may consist of one or more addressable erase unit. The eraseunit or block typically stores one or more pages of data, the page beingthe unit of programming and reading, although more than one page may beprogrammed or read in a single operation. Each page typically stores oneor more sectors of data, the size of the sector being defined by thehost system. An example is a sector of 512 bytes of user data, followinga standard established with magnetic disk drives, plus some number ofbytes of overhead information about the user data and/or the block inwith it is stored.

In the example shown in FIG. 6, individual memory cells in the memoryarray 200 are accessible by word lines WL0-WLy and bit lines BL0-BLx.The memory is organized into erase blocks, such as erase blocks 0, 1, .. . m. Referring also to FIGS. 5A and 5B, if the NAND string 50 contains16 memory cells, then the first bank of NAND strings in the array willbe accessible by WL0 to WL15. The erase block 0 is organized to have allthe memory cells of the first bank of NAND strings erased together. Inanother memory architecture, more than one bank of NAND strings may beerased together.

FIG. 7 illustrates a series of programming voltage pulses in the form ofa staircase waveform being applied to a selected word line. When a cellis programmed to a given state, it is subject to successive programmingvoltage pulses, each time attempting to add incremental charge to thefloating gate. In between pulses, the cell is read back or verified todetermine its source-drain current relative to the breakpoint level.Programming stops when the current state has been verified to reach thedesired state. The programming pulse train used may have increasingperiod or amplitude in order to counteract the accumulating electronsprogrammed into the charge storage unit of the memory cell. Programmingcircuits generally apply a series of programming pulses to a selectedword line. In this way, a page of memory cells whose control gates areconnected to the word line can be programmed together.

Memory Testing System

FIG. 8 to FIG. 10 illustrate example memory testing systems in which thevarious aspects of the present invention may be implemented.

To achieve good programming performance, the initial programming voltageV_(PGM0) and the step size must be optimally chosen. If the initialprogramming voltage V_(PGM0) is chosen too low, it may require anexcessive number of programming pulses to reach the target state. On theother hand if V_(PGM0) is chosen too high, especially in a multi-statememory, the programming may overshoot the target state in the firstpulse. Similar considerations apply to the step size from one pulse tothe next. Generally, an optimum step size will allow adequate resolutionto transverse each partitioned or demarcated region shown in FIG. 3 in afew steps. An optimum initial programming voltage V_(PGM0) would reachthe target state in a few steps. Generally, the step size can bepredetermined based on the number of partitions in the threshold window.The optimum V_(PGM0) is fairly sensitive to manufacturing variations andis traditionally determined by testing at the factory. This is a processknown as V_(PGM0) trimming.

FIG. 8 illustrates a typical testing setup to determine optimum initialprogramming voltages for a number of memory chips. A memory tester 300typically connects to a large number of memory chips 100 for paralleltesting. Typically, before shipping from the factory, a dedicated memorytester is setup to test a number of memory chips in parallel. One of thetests is to determine optimum initial programming voltages (V_(PGM0)trimmings.)

FIG. 9 illustrates schematically the functional blocks of the memorytester testing one of the memory chips shown in FIG. 8 for determinationof optimum initial programming voltages. Essentially, the memory tester300 issues a series of commands to the memory chip 100 for it to performa number of program operations using different samples of initialprogramming voltage. The non-volatile memory array 200 has a reservedarea (“ROMFUSE”) 202 for storing system data. The memory testerinteracts with the on-chip memory controller 210 via a memory interface310. The tester has a processor 302 that executes a test program in RAM304 that was initially retrieved from ROM 308. The test programexecution is facilitated by a set of tester registers 306. The testprogram is controlled by a user through inputs from a user interface312. Based on the test results, optimum initial programming voltagesV_(PGM0) are determined for various programming variations, such asdifferent type of word lines and pages. These trimmed values are thenstored back into the ROMFUSE 202. During normal use of the memory, thedata in the ROMFUSE is loaded into the controller registers 350 onpower-up so that the controller 210 has ready access to them duringmemory operations.

FIG. 10 illustrates the function blocks of an alternate memory testeroperating with one of the memory chips shown in FIG. 8 for determinationof optimum initial programming voltages, according to a preferredembodiment. In this implementation, much of the testing functionalitiesare built into the memory chip 100 itself. The on-chip memory controller210′ is further enhanced with an embedded Built-in Self Test (“BIST”)module 340 and additional capacity for a set of controller registers350. In this way, various tests including the V_(PGM) trimmingoperations described may be performed on-chip. Based on the testresults, an optimum initial programming voltage V_(PGM0) can bedetermined either on-chip or by the external tester 330. This determinedvalue is stored back into the ROMFUSE 202. During normal use of thememory, on power-up, the data in the ROMFUSE is loaded into thecontroller registers 350 on power-up so that the controller 210′ hasready access to them during memory operations.

With the enhanced, self-testing on-chip controller 210′, an externaldedicated tester may no longer be required. A simple tester 310,implemented by a personal computer, will suffice for operating a largenumber of memory chips when they are being tested in parallel. Thememory tester 310 interacts with the on-chip memory controller 210′ viaa tester memory interface 332. It receives operator inputs from a userinterface 334. In one implementation, the tester 310 simply instructseach of the memory chips 100 to execute a self test and reports thestatus at the end of the test for each memory chip. In anotherimplementation, the tester 310 gathers the statistics from the testresults and makes statistical computations.

The self-testing on-chip controller 210′ has the advantage of doing awaywith an expensive dedicated tester. Furthermore, it allows thepossibility of testing in the field, so that as the memory device ages,its V_(PGM0) values could be re-trimmed.

V_(PGM) Trimming Operations

FIG. 11A is a flow diagram illustrating a general scheme for obtainingan estimated starting programming voltage for a given type of word linesin a memory device. As mentioned before, this process is also referredto as programming voltage (“V_(PGM)”) trimming.

STEP 400: Selecting a Good Block i. In some implementation, it ispreferable to perform a quick programmability test on a block beforesubjecting it to a more time consuming V_(PGM) trimming operation.Depending on implementation, this step is optional. It may be omitted bysimply ignoring any defective word lines encountered. A more detaileddescription of determining a good block is shown in FIG. 11B.STEP 410: Selecting a group of word lines in the selected block i forsampling; {WL(i, j) where j=0, m−1}. Generally, the group of word linesselected and the type of word lines it seeks to represent share similarprogramming characteristics.STEP 420: Determining an initial programming voltage V_(PGM0)(i, j) forthe page on WL(i, j) such that a staircase pulsing voltage waveformstarting from V_(PGM0)(i, j) will program the whole page to a designatedstate within a predetermined number of pulses. A page of memory cellssharing the word line WL(i, j) is programmed in parallel. The staircasewaveform increases by a step with every pulse and is budgeted toincrease up to the predetermined number of pulses.STEP 460: Selecting more blocks if desired to gather enough of a sampleby repeating STEPs 400-420. For example, each block may contain threetypes of word lines having different programming characteristics. Thefirst type comprises the first two word lines at the top boundary of theblock. The second type comprises the last two word lines at the bottomboundary of the block. The third type comprises the bulk of the wordlines in the core region of the block. To get a better sample for anyone of these three types of word lines, a bigger sample is preferablytaken, involving more blocks distributed across the memory array. Asdescribed in a later section, different samples of a similar type ofword lines may also be formed by taking geometrically similarly locatedword lines from a set of blocks.STEP 470: Computing an average starting programming voltage(“<V_(PGM0)>”) for the entire sample of word lines. This is obtained bydividing the aggregate of V_(PGM0) for each sampled word line by theaggregate of all sampled word lines, viz.:

${\text{<}{V_{{PGM}\; 0}\left( {i,j} \right)}\text{>}} = {\sum\limits_{i,j}^{\;}{{V_{{PGM}\; 0}\left( {i,j} \right)}/\sum\limits_{i,j}^{\;}}}$

FIG. 11B illustrates in more detail one embodiment of selecting a goodblock shown in FIG. 11A. A good block is meant to be a block where allits pages of memory cells along the word lines are programmable. Thus,STEP 400 shown in FIG. 11A is further articulated as follows:

STEP 401: Erasing the block.

STEP 402: Programming in turn all the word lines in the block to adesignated state using a predetermined number of pulses.

STEP 404: Is any word lines in the block fail to program successfully?If there is any failed ones, proceeding to STEP 406, otherwiseproceeding to STEP 408.

STEP 406: The block is considered bad since it contains at least onedefective word line. This is especially true of memory with NANDarchitecture, where a bad cell within a NAND chain usually renders thewhole chain inoperable. The bad block will not be selected for V_(PGM)trimming.STEP 408: The block is good. The good block will be selected for V_(PGM)trimming.STEP 409: Erasing the block to that the word line in it are ready to beprogrammed.

In other implementations, where the existence of one or more defectiveword line does not necessary render the whole block defective, there isno need to perform a bad block search. In that case as described before,if a defective word line is encountered during test, it is simplyignored.

FIG. 12 is a flow diagram illustrating a conventional implementation ofthe steps of determining an initial programming voltage of a page on aword line. In a conventional implementation of STEP 420 of FIG. 11A, thesampled word lines in a block are tested in a piece-meal manner forexpediency and efficient use of storage. The next word line will betested after the test on the previous one has completed. Thus, after theprevious word line has been tested to program successfully (ordetermined to be unprogrammable) will the test be repeated on the nextword line. In the convention case, STEP 420 shown in FIG. 11A will befurther articulated as follows:

STEP 422: Erasing the block i so that the word lines in it can beprogrammed.

STEP 424: Initially, point to the first word line of the sample bysetting j=0.

STEP 426: Using the “j” index to select the word line WL(i, j) from thesample in the block.

STEP 428: Setting the initial values of the starting programmingvoltages: V_(PGM0)(i, j)=V_(PGM0) _(—) 0.

Step 430: Programming a page on the word line to a designated stateusing a predetermined number of pulses starting from V_(PGM0)(i, j).

STEP 432: Is page/WL programmed? If WL(i,j) is not programmed to thedesignated state, proceeding to STEP 440, otherwise proceeding to STEP450.

STEP 440: incrementing V_(PGM0)(i, j) such that V_(PGM0)(i,j)=V_(PGM0)(i, j)+ΔV.

STEP 442: Erasing the block i to allow the word line to be reprogrammedwith the incremented V_(PGM0)(i, j).

STEP 450: The page has been programmed successfully. Collectingstatistics by saving V_(PGM0)(i, j).

STEP 452: Erasing the block i to allow the next word line to beprogrammed.

STEP 454: Is the last word line in the sample reached? If the last wordline has not been tested, proceeding to STEP 456, otherwise proceedingto STEP 460 in FIG. 10A.

STEP 456: Moving to the next word line with j=j+1, and returning to STEP424 to test the next word line.

It will be seen that in this conventional scheme, a page is repeatedcycled through a succession of program loops with erases in between. Asdescribed earlier, testing the word lines in a piece-meal manner willsubject the block to many more erasures, since for each word line everyprogram loop around STEP 440 and STEP 442 will incur a block erasure.This expense is compounded on every word line under test.

Referring to FIG. 6 again, in a memory architecture where there are manyword lines in each block, erasing a word line of cells multiple timeswould entail erasing the rest of the word lines in the same block thesame number of times. As mentioned earlier, if these other word lines inthe block are also being tested, the number of times the block is erasedwould go up geometrically. As much as half of a memory device's lifeusage could be used up before it gets to a customer.

V_(PGM) Trimming with Reduced Erase Cycling

According to one aspect of the invention, in a non-volatile memoryhaving an array of memory cells that are organized into blocks, eachblock being a block of word lines for accessing memory cells that areerasable together, and each word line containing at least one page ofmemory cells that are programmable together, an optimum starting voltagefor programming a page of memory cells on a word line in a block isestimated by test programming a sample of similar word lines in theblock to obtain a statistical average of individual starting voltagesthat enable each associated page/word line to be programmable to adesignated pattern. This is accomplished by a subjecting all the pagesof the sample to a program loop where a series of pulses from a startingprogramming voltage is applied. After each pages of the sample has beenthrough the program loop, the page/word line that has beenprogram-verified is removed from further processing and its associatedstarting programming voltage is saved. The block is then erased so thatthe not yet verified word lines in the sample can be reprogrammedsubject to the next program loop the next incremented starting voltage.The cycling continues until all word lines in the sample have beenprogram-verified. A statistical average can then be obtained from theindividual starting programming voltages to derive an optimum startingprogramming voltage for the page.

Testing the sample of word lines in a block by the scheme described hasthe advantage of reducing the number of block erasures. The sample ofword line are tested in phase with each other, so that when all the wordlines are done programming in each program loop, they are then erasedtogether to be ready for the next program loop. This scheme results inreducing the number of block erasure and can result in a saving of oneorder of magnitude compared to a conventional scheme. For example, theconvention scheme shown in FIG. 11 has each word line testedindependently with block erasure before every program loop withoutsynchronization with each other. The block erasure associated with everyprogram loop for one word line is then compound for every word line inthe sample.

FIG. 13 is a flow diagram illustrating generally an operation forestimating an optimum starting programming voltage from a sample of wordlines within a block, according to a preferred embodiment of theinvention. The operation is illustrated to have three phases. The firstphase 500 is for testing and collecting statistics of a sample ofpages/word lines within a block. It includes STEP 510 to STEP 550. Eachword line may support one or more physical page of memory cells. Inaddition, each page of memory cells may store one or more logical pagesof data depending how many bits each memory cell can store. Thus,multiple logical pages may be associated with a given word line. Insofaras there are any significant variations in programming characteristicsin programming the various logical pages, the programming of eachlogical page may have its own V_(PGM) trimming on the same word line. Atany one time the testing is directed to the programming of a givenlogical page on a given word line. For expediency, the terminologyrefers to testing a page or a word line interchangeably. The secondphase, including STEP 560, is to repeat the first phase 500 on otherblocks to be sampled. The first two phases can take place concurrentlyif the decoding and programming circuits support operating on more thanone block. The third phase, including STEP 570 to STEP 572, is tocompute a statistical average in order to derive an estimated optimumstarting programming voltage for the type of word line under test.

The present operation essentially cycles through the word lines in thesample by applying a programming step to each word line with anassociated starting voltage and then verifying to determine if the pageon the word line is programmed to a designated state within a specifiedprogram loop target. If any page/word line is program-verified, thestarting voltage associated with it is saved. If the page/word line isnot yet program-verified, the starting voltage associated with it isincremented. The increment information is also saved, preferably into anaccumulator. The cycling through the word lines is repeated on the onesthat have not been program-verified so that after a block erasure, theyare subject to another programming step with associated incrementedstarting voltages. This process continues until all the word lines inthe sample have been program-verified within the specified program looptarget.

STEP 510: Selecting a sample of pages representative of a given type ofpage within a block.

STEP 520: Providing an initial value to a starting programming voltageassociated with each of the pages in the sample.

STEP 530: Erasing the block containing the sample of pages.

STEP 540: Programming sequentially a subset of pages among the sample ofpages not yet programmed to a target pattern, each page of the subsetbeing programmed with the associated starting programming voltage,wherein after programming of each page:

verifying if the target pattern has been programmed thereto; and

incrementing the associated starting programming voltage by apredetermined amount when the page has not been program-verified,otherwise, saving information for deriving the associated startingprogramming voltage that enables the page to be program-verified.

STEP 550: Are all pages of the sample program-verified? If the pages arenot all verified, returning to STEP 530, otherwise proceeding to STEP560.

STEP 560: Repeating STEP 500 to STEP 560 for other blocks selected toinclude in the sample.

STEP 570: Computing an average starting programming voltage for thesample from the associated saved information.

STEP 572: Deriving a starting programming voltage for the given type ofpage based on the average starting programming voltage of the sample.

The specified program loop target is a limit for the maximum number ofincrements allowed. This limit has two different implications whenimplemented in two different manners.

In one embodiment, the limit sets a relatively low increment ceiling. Itsets the maximum number of programming pulses or increments from thegiven starting voltage before programming of the page is deemedunsuccessful or insufficient. This number is set to be similar to thenumber of programming steps budgeted during an actual program operationin a normal use of the memory device. For example, in a normal programoperation by the user, the programming for a particular logical page isrequired to be completed within eight to ten programming pulses. In thisway, the V_(PGM) trimming test closely duplicates real programmingconditions. In general this limit ranges from five to fifteen.

In another embodiment to be described in more detail later, theprogramming voltage is allowed to increment until a final voltageresults in a programmed page. The final voltage is then used to estimatean optimum starting voltage by scaling back a predetermined number ofsteps. In this embodiment, there is no limit set to emulate normalprogramming conditions. However, the increment of the startingprogramming voltage is not boundless in case a defective word line isencountered. Thus, the limit is set to a relative high (e.g., thirty tofifty) number to limit the increments to a maximum predetermined valuein case a defective word line is encountered. When a page fails to beprogrammed to the designated state after the starting programmingvoltage has been incremented to the maximum value, the word line isdeemed defective and its V_(PGM) data will be excluded from thestatistics. In another implementation, the whole block containing thedefective word line may be excluded.

Thus, the two embodiments described impose a limit on the program loopfor different reasons. The first with a lower limit measures programmingsuccess from a starting voltage by providing a margin of a number ofpulsing steps as in a normal program operation. Programming is deemedsuccessful if completed within the limit. Conversely, unsuccessfulprogramming implies that the starting voltage is set too low. The secondembodiment with the limit set to a high ceiling is to prevent boundlessincrements in case a defective word line can never by programmed. Thus,when this limit is reached, it does not mean the starting voltage is toolow, but the word line is simply defective.

In yet another implementation, a lower limit is also contemplated. Ifthe program loop is completed within the first few (e.g., one or two)steps of the staircase waveform, it will mean that the page has a veryfast programming characteristics, which is not typical. Thus, in thecase when a page is program-verified within a predetermined lower limit,it is deemed atypical and will also be excluded from the averaging so asnot to skew the statistics.

FIG. 14 is a flow diagram illustrating a specific implementation of theoperation shown in FIG. 13.

step 610: Setting initial values for block i:

Page verify status: PageDone(j)=FALSE for all j

Initial programming voltage: V_(PGM0)(i, j)=V_(PGM0) _(—) 0 for all j

# of DVPGM0: StepUp#(j)=0 for all j.

STEP 620: Erasing the block i.

STEP 630: j=0.

STEP 632: Selecting word line WL(i, j) among a sample: j=0, m−1

STEP 640: Programming a page on the word line to a designated stateusing up to a predetermined number of pulses starting from V_(PGM0)(i,j).

STEP 642: Is page programmed? If the page is not program-verified,proceeding to STEP 650, otherwise proceeding to STEP 660.

STEP 650: The word line is not yet program-verified. So its associatedinitial programming voltage will be incremented by an additional step.Incrementing StepUp#(j): StepUp#(j)=StepUp#(j)+1.

STEP 652: Incrementing V_(PGM0)(i, j): V_(PGM0)(i, j)=V_(PGM0)(i,j)+StepUp#(j)*ΔV

STEP 660: Testing of the word line is done and marking Page done:PageDone=TRUE.

STEP 662: The information for the final programming voltage isaccumulated as the number of stepups from the initial voltage.StepUp#Global=StepUp#Global+Stepup#(j).

STEP 670: Next word line: j=j+1.

STEP 672: Last word line in the sample reached? (i.e. j=m?) If WL(i, j)is not the last word line, proceeding to STEP 680, otherwise proceedingto STEP 690.

STEP 680: Not processing done page: Is PageDone(j)=TRUE? If the statusindicates the current page is already program-verified, it will beignored or skipped with the process proceeding to STEP 670, otherwisethe process returning to STEP 632 to testing the next word line that hasnot yet been program-verified.STEP 690: Rescanning remaining not-done WLs until all pages/WLs areprogrammed: IsPageDone(j)=TRUE for all j? If at least one word line isnot program-verified, returning to STEP 620 to reprogram it with theincremented programming voltage, otherwise programming of all word linesare done and the process will proceed to STEP 560 in FIG. 12.Estimation of a Starting Voltage by Scaling

According to another aspect of the invention, the initial value of astarting programming voltage is estimated by an initial programming testrun of the page on a word line. A selected page on a word line issuccessively programmed by a series of voltage pulses of a staircasewaveform with verifications in between the pulses until the page isverified to have been programmed to a designated pattern. The finalprogramming voltage at the time the page is program-verified will beused to estimate a starting programming voltage by scaling back apredetermined amount. An average starting programming voltage isobtained by considering a sample of similar page/word lines. Anyunprogrammable page/word lines in the sample can be ignored so as not toskew the statistics with atypical entries.

In another embodiment, the process is further refined in which theestimated starting programming voltage from a first pass is used as theinitial value of the staircase waveform in a second pass. In this way,when averaging over a sample of similar pages, the starting programmingvoltage for a representative page can be estimated. The startingprogramming voltage is estimated by offsetting the final programmingvoltage negatively by a predetermined number of steps of the staircasewaveform. The predetermined number of steps is preferably similar to thenumber of steps budgeted for program success in a normal programoperation.

One advantage of this scaling scheme is that a simple one- or two-passprogramming test on each page/word line is sufficient to yield anestimate for the starting programming voltage for the page. Each pagecan be tested independently and does not involve multiple eraseoperation during the test. Therefore there is no need for management ofblock erase among a sample of word lines.

FIG. 15 illustrates the staircase waveform used in the initialprogramming test of a page of memory cells. The staircase waveformvoltage is applied to the word line supporting the page of memory cells.Initially a voltage pulse at Vi is applied to perform an incrementalprogramming. This is followed by the voltage changing to VVER suitablefor reading the page to verify if the page has been programmed to adesignated pattern. The process of program pulsing and verifyingcontinues until the page is program-verified. At that point, theprogramming voltage has been incremented to Vf=StepUps#*ΔV. In oneembodiment, this final voltage is backed off a predetermined number ofsteps to serve as an estimate for the starting programming voltage forthe VPGM trimming tests described earlier, viz.:V_(PGM0)=Vf−N_(OFFSET)*ΔV, where NOFFSET is the predetermined number ofsteps.

FIG. 16 is a flow diagram illustrating the determination of startingprogramming voltage for a given page, using the staircase waveform scanshown in FIG. 15.

STEP 800: Providing an associated programming voltage for programmingthe page of memory cells, the associated programming voltage having apredetermined initial voltage level Vp=Vi.

STEP 802: Erasing the page of memory cells.

STEP 810: Applying a pulse of Vp to the page of memory cells.

STEP 812: Verifying if the page of memory cells has been programmed to acorresponding page of predetermined memory states.

STEP 814: Is page program-verified? If page is not program-verified,proceeding to STEP 820, otherwise proceeding to STEP 830.

STEP 820: Incrementing the associated programming voltage by apredetermined amount Vp=Vp+DV.

STEP 830: Saving the starting programming voltage for the page,V_(PGM0)=Vp−N_(OFFSET)*ΔV. In a preferred implementation, the estimatedstarting programming voltage is further refined in a second pass testrun where it is used as the initial value of the staircase waveform. Inthis way, the initial value more closely emulates normal programmingoperations as compared to the one used in the first pass test run.

As before, a sample of word lines of similar type are tested to obtain astatistically average starting programming voltage for the type. Inorder to reduce the storage for the test results, a statistical averageis preferably performed after each test run.

FIG. 17 is a flow diagram illustrating a multiple pass determination ofstarting programming voltage for a sample of pages/word lines.

STEP 850: Performing a first pass test run (e.g., STEP 800 to STEP 830for each page) on a sample of pages of similar type.

STEP 860: Obtaining a first statistical average for estimated startingprogramming voltages from the first pass test run: <V_(PGM0)>₁.

STEP 870: Performing a second pass test run (e.g., STEP 800 to STEP 830for each page) on a sample of pages of similar type, using <V_(PGM0)>₁as the initial value for the starting programming voltage (i.e.,Vi=<V_(PGM0)>₁).

STEP 880: Obtaining a second statistical average for estimated startingprogramming voltages from the second pass test run: <V_(PGM0)>₂.

In one embodiment, only one pass (STEP 850 to STEP 860) is sufficient toobtain an acceptable estimation of the starting programming voltage. Inanother embodiment, a second pass (STEP 870 to STEP 880) is optionallyused to refine the result obtained from the first pass.

In another implement, the estimated starting voltage <V_(PGM0)>₁ or<V_(PGM0)>₂ may be used as input for the initial value for the VPGMtrimming scheme described in FIG. 13 and FIG. 14. The tests described inSTEP 520 of FIG. 13 and STEP 610 of FIG. 14 require an initial valueV_(PGM0) _(—) 0 for the starting programming voltage. If this value isset too low, the test will have to cycle through more steps before theword line is program-verified. This will be inefficient and consumingmore erase cycles of the memory device. On the other hand if the valueis set too high, the word line may be over-programmed.

V_(PGM) Trimming Weighted Toward Faster Programming Pages

According to another aspect of the invention, in a memory array havingmultiple erasable blocks, each block having a group of word lines withsimilar type of programming characteristics, a scheme for obtaining anoptimum starting programming voltage of a representative page of thegroup includes: forming samples over a set of blocks with one or moreword line from a geometrically similar location of each block of theset, obtaining a statistic estimation of a programming voltage from eachsample of the set, and selecting a minimum estimation among the set toderive the optimum starting programming voltage. In this way, theoptimum value is weighted towards the faster programming word lines forthat group since they require a lower programming voltage compared tothe slower ones.

The programming voltage trimming schemes described in other sectionsexamine a page at a time as to whether all bits in the page areprogram-verified or not. This implies the test results are catering tothe slower programming bits, as these slower bits must also beprogram-verified before the whole page is deemed program-verified. Theconsequence is that the starting voltage may be over estimated for thefaster programming bits with the danger of over-programming. The presentsampling and statistical computational scheme allows a lowest value tobe selected for the set of starting voltages that was derived from ascheme biased towards the slower programming bits.

By geometrically similar location, it is understood that there arecertain symmetries in the layout of the physical memory array.Structures belonging to the same symmetry group would have very similarcharacteristics. Referring to FIG. 6 for example, WL2 to WL13 form agroup of word lines in the core region of an erase block with somewhatsimilar but not identical type of programming characteristics. A set ofblocks is for example, from block0 to block127. The samples are formedby selecting a word line from a geometrically similar location of eachblock of the set. Thus, a first sample would be constituted from WL2from block0, WL18 from block1, WL34 from block2, . . . , WL1034 fromblock127. A second sample would be constituted from WL3 from block0,WL19 from block1, WL35 from block2, . . . , WL1035 from block127. All inall there will be a set of 128 samples. V_(PGM) trimming operations canbe performed on each of the samples and therefore 128 statisticalresults (e.g. <V_(PGM0)> will be obtained. The present method calls forselecting a smallest one among the 128<V_(PGM0)>s.

FIG. 18 is a flow diagram illustrating the scheme of obtaining a V_(PGM)trimmed value that is weighted toward the faster programming word lines.

STEP 900: Providing a non-volatile memory having an array of memorycells that is organized into erasable blocks, each erasable blockcontaining a block of word lines for accessing memory cells that areerasable together, and each word line containing at least one page ofmemory cells that are programmable together.STEP 902: Selecting a group of pages representative of the page within ablock.STEP 904: Selecting a set of blocks.STEP 906: Forming a set of samples by selecting at least a page fromeach block, the page being located in a geometrically similar locationof each block.STEP 908: Obtaining a statistical estimation of a programming voltagefrom each sample of the set.

STEP 910: Determining the starting programming voltage for the page byselecting a minimum statistical estimation among the set. The scheme oftesting individual samples formed by selecting at least a similar pagefrom each block also has the advantage of minimum storage requirement.After each sample is tested, a test result in the form of an average isobtained and stored. Then the next sample is tested in similar mannerand its average is then compared to the first one in storage. Whicheveraverage is the lower one will be retained in storage so that only onedata need be stored as the set of samples is processed a sample at atime.

Also, with the sample formed by selecting a relatively small portionfrom each of the blocks, another advantage is that the sample average isnot as sensitive to the presence of any bad blocks where a large portionof the word lines in it may be defective.

All patents, patent applications, articles, books, specifications, otherpublications, documents and things referenced herein are herebyincorporated herein by this reference in their entirety for allpurposes. To the extent of any inconsistency or conflict in thedefinition or use of a term between any of the incorporatedpublications, documents or things and the text of the present document,the definition or use of the term in the present document shall prevail.

Although the various aspects of the present invention have beendescribed with respect to certain embodiments, it is understood that theinvention is entitled to protection within the full scope of theappended claims.

1. A non-volatile memory comprising: an array of memory cells that isorganized into erasable blocks, each erasable block containing a blockof word lines for accessing memory cells that are erasable together, andeach word line containing at least one page of memory cells that areprogrammable together; a designated sample of pages representative ofthe given page within a block; an associated programming voltage forprogramming each page of the sample, the associated programming voltagehaving a staircase waveform with an initial value and a predeterminedmaximum voltage limit; a built-in self testing module for determining astarting programming voltage for a given page, said module providingmemory operations including: (a) erasing the sample of pages; (b) forevery page in the sample, programming and verifying the page step bystep of the staircase waveform until either the maximum voltage limithas been reached or the page has been programmed to a target pattern andin which case a final programming voltage is accumulated as part of agathered statistics; and (c) providing the gathered statistic forcomputing an average starting programming voltage for the sample forderiving a starting programming voltage for the given page; and wherein:the sample is part of larger sample taken from a plurality of blocks inthe memory array; and the gathered statistics includes accumulatedinitial values associated with programmable pages from the plurality ofblocks.
 2. The memory as in claim 1, wherein the nonvolatile memory isof the NAND type.
 3. The memory as in claim 1, wherein individual memorycells each stores more than one bit of data.
 4. A non-volatile memorycomprising: an array of memory cells that is organized into erasableblocks, each erasable block containing a block of word lines foraccessing memory cells that are erasable together, and each word linecontaining at least one page of memory cells that are programmabletogether; a designated sample of pages representative of the given pagewithin a block; an associated programming voltage for programming eachpage of the sample, the associated programming voltage having astaircase waveform with an initial value and a predetermined maximumvoltage limit; a built-in self-testing module for determining a startingprogramming voltage for a given page, said module providing memoryoperations including: (a) erasing the sample of pages; (b) for everypage in the sample, programming and verifying the page step by step ofthe staircase waveform until either the maximum voltage limit has beenreached or the page has been programmed to a target pattern and in whichcase a final programming voltage is accumulated as part of a gatheredstatistics; and (c) providing the gathered statistic for computing anaverage starting programming voltage for the sample for deriving astarting programming voltage for the given page; and wherein said moduleprovides memory operations further including: forming a set of samplesby selecting a page from each block, the page being located in ageometrically similar location of each block; obtaining a statisticalestimation of a programming voltage from each sample of the set; andwherein the step of computing an average starting programming voltagefor the sample is by selecting a minimum statistical estimation amongthe set.
 5. The memory as in claim 4, wherein the nonvolatile memory isof the NAND type.
 6. The memory as in claim 4, wherein individual memorycells each stores more than one bit of data.